The Binary Regenerative Channel* 

By RICHARD H. McCULLOUGH 

The nature of the errors in a regenerative digital transmission system 
is such that a memoryless channel is a poor model for predicting the error 
phenomena. In this paper we present a model which provides a reasonable 
approximation to observed error phenomena. The memory of the channel is 
represented by a Markov model. This model is similar to the model de- 
veloped by E. N. Gilbert, but several important modifications greatly 
simplify the estimation of parameters, and make the model correspond 
more closely to the physical phenomena involved. 

Bounds for the channel capacity of the binary regenerative channel are 
obtained. Error separation, block error, and burst statistics are derived. 

Error model parameters are derived from available experimental data 
on the Tl digital transmission line and the switched telephone netivork. 
The Markov model is shown to provide a good representation of the observed 
error phenomena. 

I. INTRODUCTION 

The Gilbert burst-noise channel introduced the idea of error states. 1 
The error states represent different error processes, each of which 
generates independent errors. Gilbert's model yields a "renewal er- 
ror process/' that is, an error process for which the gaps between 
successive errors are independent random variables with the same 
probability distribution. Elliott 2 introduced a generalization which 
yields what we shall call a "Markov error process," that is, an error 
process for which the gaps between errors are dependent random 
variables with probability distributions which depend only on the 
last gap between errors. More recently, Elliott used a renewal error 
process, with component error processes which do not generate in- 
dependent errors. 3 In order to match experimental data for block 



* This paper is based on materia] taken from a dissertation submitted to the 
Polytechnic Institute of Brooklyn in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy in 1967. 
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error distributions, he found it necessary to introduce three empiri- 
cally derived error separation distributions. 

Berger and Mandelbrot proposed a renewal error process in which 
the error separation follows a Pareto distribution. 4 Sussman has used 
the Pareto distribution to model the switched telephone network. 5 

Gilbert's model and Elliott's model both assume that the transition 
probabilities for the error states are independent of the occurrence of 
errors. In this paper we drop that assumption to define a general 
Markov error process. We then consider a particular Markov error 
process in which transitions between error states are allowed only 
when an error occurs. We associate this error process with the 
"binary regenerative channel." Error separation, block error, and 
burst statistics are derived for this latter process. Error model param- 
eters are calculated from available data for the Tl digital transmis- 
sion line, and for the switched telephone network. We discuss briefly 
the possible usefulness of the Pareto distribution for approximating 
a many-state Markov error process, or for approximating a non- 
stationary error process. 

This author extends this model to apply to a ternary channel. 6 

II. ERROR MODEL 

An error model must be able to reproduce the burst error phe- 
nomena which are known to occur in digital channels. Real channels 
seldom appear to be memoryless, and it is common for a large frac- 
tion of the errors to be burst errors. To reproduce the burst phe- 
nomena, we have chosen to use a Markov model similar to Gilbert's. 1 
Our model differs from his in two important aspects. First, we have 
attempted to make the model correspond more closely to the physical 
phenomena involved by introducing several error-producing states, 
each with different error rates. Second, transitions between states are 
allowed only immediately following an error. This assumption greatly 
simplifies estimation of the parameters of the model, since the num- 
ber of digits between adjacent errors is determined by a single error 
state. 

The similarities and differences between these models are most 
easily understood by examining the transitions between error states. 
We shall restrict our present discussion to two-state error processes. 
We define : 

S„ = error state for the nth error digit 
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Z n = nth error digit = 1 for error and for no error 
p u = Prob {S n = ; | S n _ t = i, Z n ., = 0} 
q ti = Prob {S n = j | 2U = {, Z n _ x = 1} 

P { = Prob {Z„ = 1 \X„ = i] = average error probability (for the 
binary symmetric channel) of state i. 

The state diagram of the general Markov model is shown in Fig. 1. 
A renewal error process is obtained if q V} is independent of i (so that 
the next error state is independent of the state which produces the 
error) , or if P x or P 2 is zero (so that only one state produces errors) . 

The Gilbert burst-error process 1 assumes that p ti = q ti and P a = 0. 
The assumption that P 2 = makes this process a renewal error process. 
Elliott's generalization 2 assumes p it = q {i but P 2 9^ 0. This process is 
a renewal error process only if g u = q 2l (and q l2 = 922)- Our model, the 
"binary regenerative channel," assumes that p ti = 5,, , the Kronecker 
delta. This process is a renewal error process only if q u = g 2 i (and 
3l2 = #22) ■ 

Our assumption that state transitions can occur only after errors 
(Pij = fy) seems reasonable for two reasons. First we hold the opera- 
tional viewpoint that all our information comes from the occurrence 
of errors, and we might as well assume that nothing changes between 
errors. This also provides a practical technique for estimating transi- 
tion probabilities from error separation data. Furthermore, this model 
seems to be quite "stable" in that extremely small transition proba- 
bilities are not encountered in practice, so that statistical estimates 
are relatively easy to obtain. 




Fig. 1 — State diagram of general Markov model. 



1716 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1968 

Second, this is exactly the model we would choose for an error process 
consisting of random errors plus signal-correlated errors, where a certain 
fraction of the random errors produce a wake of closely spaced errors. 
In the case of bursts which are not correlated with random errors, our 
physical intuition suggests something like p u = 1, p 12 = and p 2l « 
fti « 1 would be more appropriate. Operationally, however, it really 
does not matter if the first error in a burst is "incorrectly" identified 
as a random error. 

Although the mathematical descriptions of the Gilbert model, the 
Elliott model, and our new model are different, we suspect that the er- 
ror separation, block error, and burst statistics obtained from the 
three models will be quite similar. (We show that the form of the 
error separation statistics is identical for the Gilbert model and our 
new model.) We contend, however, that the new model is more use- 
ful because the parameters of the model are easily determined from 
experimental data and are easier to interpret in terms of physical 
noise processes. 

In the above discussion we have considered the two error states to 
correspond to different physical error processes in a single channel. How- 
ever, this single channel is clearly equivalent to a two channel trans- 
mission system where the "error" state indicates which channel is 
being used. We use this latter interpretation in the next section. Notice 
that the two channels are simply binary symmetric channels with 
different error rates. In practice we have P 2 « -Pi ~ 1/2; therefore, 
we refer to state (channel) 1 as the burst error state (channel) and state 
(channel) 2 as the random error state (channel). 

III. CHANNEL CAPACITY 

Closed form expressions for the capacity of the Markov channel have 
not yet been found* so that we are limited to determining the capacity 
for specific numerical values of the parameters. On the other hand, we 
can find reasonably simple and tight bounds on the capacity which 
are quite useful. Therefore we consider only bounds on the channel 
capacity. 

Let the sequences of input, output, and error digits be denoted by 
X t , Y { , and Z, , respectively, with 7,- = (X { + Z t ) mod 2 and i = 
1,2, • • • . Since the noise sequence is independent of the input sequence, 



* Note that the method used by Gilbert 1 is valid only for a renewal error 
process, and did not yield a closed form solution. 
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the channel capacity is given by 



C = max [H(X) - H(X\Y)] 

P(X) 



= max [H(X) 

p(.Y) 



H(Z)] = 1 - H(Z) 



where H denotes the entropy of the sequence and p(X) is the prob- 
ability of the sequence X. Following Ash 7 we define the entropy of 
the noise sequence by 

H(Z) = lim H(Z n | Z n _j , • • • , Z,). 

n->oo 

We shall find bounds on the channel capacity by bounding the en- 
tropy of the noise, using the steady state probabilities of the Markov 
noise sequence. 

Consider the state diagram in Fig. 1. There are really only four 
states which we shall designate by S { = 11, 10, 21, or 20, i = 1, 2, ... , 
where the first digit indicates which of the two binary symmetric 
channels is being used (2* = 1 or 2) and the second digit gives the 
value of the error digit (Z { = 1 or 0) . Thus the state diagram may be 
redrawn as shown in Fig. 2 (using p y = 8y) . The steady state proba- 
bilities 1'ij are the solutions of the equations 



l/ll^Wjl^oJ — ITll^lU^l^Oj 



9nP. 3n(l - Pi) 912^2 
P, 1 - P l 

52,^! ?2l(l - Pl) 922^2 

P 2 



912(1 - 


-P 2 ) 







g 22 (l - 


-P2) 


1 - 


P 2 _ 



It can be shown that 



r„ = fliPi 

r 10 = Ki(l - Pi) 

r 2i = R 2 P 2 

r 20 = K 2 (l - P 2 ) 



where 



*-*.* 



Qi = 2ii 

9l2 + ?21 



R 2 = P t 



P, 



Q> = 



9l2 
9l2 + §21 
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q„P, q 22 P 2 



CL«(1-P,) 




q 22 0-P 2 ) 



i-P, i-P 2 

Fig. 2 — State diagram of binary regenerative channel. 

1 



P. = 



= R\P\ ~h xi 2 i 2 • 



P, ^P 2 

We notice that Pi and P 2 are the steady state probabilities of 2i, Qi 
and Q 2 are the steady state probabilities of 2* given that Z^i = 1, and 
P e is the steady state probability that Z* = 1 (that is, P e is the av- 
erage error rate) . 

We are now ready to compute upper and lower bounds on H(Z). 
An upper bound is 

H(Z) ^ lim H(Z n | Zn-x) 

= PMQiPi + Q2P2) + (l - ^«) A [i~ £ p- CI - Qi p i - Q2P2) J 

where MP) = -P log P - (1-P) log (1-P). A simpler (and looser) 
upper bound is 

lim#(Z n I Z_0 ^ lim#(Z n ) = h(P.). 

n-»oo n-*oo 

Since S n is determined by a first order Markov process, a lower 
bound is 



H(Z) ^ lim H(Z n I Z n _, , • • • , Z, ; £„_,) 
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= lim H(Z n I &_,) 

n-«oo 

= R 1 P 1 h(QnP i + 9i2-P 2 ) + PiU - PiMPr) 

+ R.P.hiq^P, + g 22 P 2 ) + fi 2 (l - P 2 )^(P 2 ). 

Using the fact that h(P) is a convex function we obtain a simpler 
(and looser) lower bound 

lim H(Z n | £„_,) ^ BiPifonACPi) + 61WO] + #i(l ~ POWO 

n-»oo 

+ « A[«,ifc(Pi) + GwW.)] + B a (i - *W*) 
= BiWi) + R*h(P 2 ) 

= \im H(Z n |X). 

n-»oo 

From the loose bounds we see that the capacity of the binary regen- 
erative channel is greater than the capacity C = 1— h(P e ) of a binary 
symmetric channel with the same average error rate, and is less than 

the capacity C = BJl-MPi)] + B 2 [l-ft(P 2 )] wnicn could be 

achieved if we always knew which component channel was being used. 

A convenient way to describe the channel capacity is to give the 

probability P c of the binary symmetric channel with capacity C, that 
is, H(Z) — h{P c ). From the bounds given above it follows that 

h~ 1 [R l h(P 1 ) + R a h(P 2 )] ^ /T'tlim H(Z n | £„_>)] ^ P e 

n-»oo 

^ fc _I [lim H(Z n | Z n _,)] ^ P. . 

For the practical case where P 2 « Pi w 1/2 and Qi ^ Q 2 P^ 1/2 the 
above inequalities are approximately 

P 2 < P 2 < P c < (1 - QiPJP. < P. 

or 

Q.P. < P e < (l - QiPOP. • 

The loose bounds given above can be generalized to apply to any 
finite number of memoryless, nonsymmetric channels in the form 

BiRrPr + • • • + R m P m ) ^ C ^ R 1 B(P 1 ) + • • • + R m B(P m ) 

where R { is the steady state probability of using channel i and B(P { ) 
is the capacity of channel i. 
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Digital transmission systems may use many digital links, with 
regeneration of the transmitted signal at the end of each link. For a 
system of N identical links where NP 2 « Pi , we can approximate the 
over-all system by a single digital regenerative link with the same q ti 
and P, but with P 3 replaced by P' 2 = NP 2 . This substitution yields 

P: « NP e R[ pa NRr R' 2 « R 2 

which agrees with one's intuitive notion of how the over-all system 
should behave. That is, the number of random errors and burst errors 
both increase by a factor of N, and the length of the bursts remains 
the same. 

IV. ERROR STATISTICS 

We shall now derive error separation, block error, and burst statis- 
tics for the two-state Markov error process. We assume two com- 
ponent error processes which generate independent errors with dif- 
ferent average error rates, P i} i = 1, 2. Transitions between error 
states are allowed only after errors, with the probabilities given by 

q fi = Prob {state i — » state j|last digit was an error}. 

For the error separation statistics we shall make use of several re- 
sults for independent errors. We begin by deriving the basic equations 
for an independent error process. Let P be the probability that any 
digit is in error. The error sequence then contains a 1 with probability 
P and with probability 1-P. Given an error, the probability that 
the next error occurs on the fcth digit is 

p(k) = Prob [C^ l l\l) - Prob {0*- J l} = P(l - P)*" 1 . 
The average error separation is 

k = E kp(k) - P £ fc(l - P)"- 1 = ±- 

«:-i *=i * 

The probability that the number of good digits between errors is 
greater than or equal to n (that is, the error separation is n+1 digits 
or greater) is given by the cumulative distribution 

Q(n) = Prob {k > n) = 1 - Ep(fc) 

k = l 

= i - p i; a - py~ i = (i - p) n - 
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For P « 1 the following approximation is quite useful 

(1 - P) n = e loe(1 - p) tt e- P . 
The probability of getting m errors in n digits is 

P(m,n) = —r^ r-.P'"(l - P) n ~'". 

ml (n — m)\ 

The probability that a block of n digits contains an error is 

P(^ 1, n) = 1 - P(0, n) = 1 - (1 - P) n « nP for P « 1, n « J;- 

Let Q h j = 1, 2, be the unconditional probability of being in state 
; at the first digit following an error. Making use of the results for 
the independent error process, we have 

p(k) = Q^U - P,)*" 1 + Q 2 P 2 (1 - P 2 ) k ~ l 

c _ Qi 4_ Qi = J_ 

P! r P 2 P. 

Q(n) - (Ml - PO" + Q,(l - P 2 ) n 

where P c is the average error rate. 

The expression for P(m, w) for the Markov error process is a very 
complicated function of the parameters of the process. However, the 
form of the dependence upon n is easily found through an appropriate 
set of recurrence relations. The recurrence relations are also useful 
for computing numerical values on a digital computer. 

Let A ( (m, n) be the probability that m errors have occurred (that 
is, m occurrences of state 11 or 21) in n digits and that channel i is 
used for the ?i+lst digit (that is, 2„+i = i)- Then 

P(m, n) = Ai(in, n) + A 2 (fn, n). 

Considering all possible events which may occur at the nth digit we 
obtain the following pair of recurrence relations. 

Ai(m, n) = Ai(m, n — 1)-(1 — Pi) + A x (m — 1, n — \)P x q n 

+ A 2 (m — 1, n — 1)-P 2 g 2 , 
A 2 (m, n) = A 2 (m, n — 1)-(1 — P 2 ) + A a (m — 1, n — 1)-P 2 g22 

+ A x (m — 1, n — 1)-Piqi 2 . 
Solving the equations for successive values of m we find that the 
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general solution has the form' 1 

P(m, n) = G m (?i)(l - P 1 ) n - m + C?*(»)(l - P 2 ) n - m 

where G m (n) is a polynomial of degree to in the variable n, and the 
asterisk denotes a cyclic permutation of the parameter subscripts, 
that is, 1 -> 2 and 2 -» 1. Assuming that the Markov process is in the 
steady state at digit "zero," the first two polynomials are 

G Q (n) = R, 

O l {n) = 2{~f^)R i P 1 q 12 + IRJPitafr. 

It is possible to determine the functional dependence of P (to, n) upon 
n because we assumed that the error state changes only after an er- 
ror occurs. This effectively decouples the set of recurrence relations 
so that A x {m, n) and -A 2 (to, n) can be determined separately. For 
larger m the explicit expressions for the coefficients of G m {n) become 
so complicated that they are of little use. Thus one can only hope to 
gain some insight into the behavior of P (m, n) as a function of to by 
numerical evaluation for a typical case. 

The average number of digit errors in a block of n digits is 



m 



= 2 »P(*i *>) = nP e . 



Given that the block contains one or more errors, the average number 
of digit errors is 



to 
e = 



P(fcl, n) Q 1 [l - (1 - Pi)"]/Pi + &[1 - (1 - P 2 ) n ]/P2 

In practice we usually have P 2 « Pi ~ 1/2 and Bi <C R a « 1. There- 
fore, we have 

P(m, n) tot G*(n)e- (n - m)Pa for n - to » 1/Px . 

Specifically, we find that 

P(0, n) » e _nP * 
P(^l,n)» 1 - e~ nP ' 

*~7T W? l fOT ^»1/P: • 

^2 1- e 

The burst error behavior of the channel is indicated by the num- 
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ber of successive occurrences of state 1, the burst error state. Let 
p(b) = Prob {leave state 1 after b errors|now in state 1} 

= rfpci -GO b- 1,2, ... . 

Notice that 6 occurrences of state 1 implies b + 1 "closely spaced" 
errors. 

In the next section we calculate error model parameters from avail- 
able data on transmission of binary information over the Tl digital 
transmission line, and over the switched telephone network. A three- 
state model is required to provide a reasonable match in some cases. 
Therefore, we digress briefly to generalize our results to apply to a 
three-state model. (Actually, we give the results in a form which is 
suitable for any finite number of states.) 

We assume three component error processes which generate inde- 
pendent errors with different average error rates, P i} i = 1, 2, 3. 
Transitions between error states are allowed only after errors, with 
the probabilities given by 

q t j = Pi'ob {state i — > state j|last digit was an error}. 

Let Qj, j = 1, 2, 3, be the unconditional probability of being in state 
j at the first digit following an error. The Qj are the solutions of the 
following set of equations: 

Qt = QiQu + Q 2 g 21 + Q3Q31 
Q 2 = Qigia + Q 2 q 22 + Q 3 g 3 2 
Qj = Qigia + Q2CI23 + Q3Q33 • 
Corresponding to the previous results we now have 

p(fc) - QJ\(1 - P,)*- 1 + Q 2 P 2 (1 - P 2 ) k ~ 1 + Q 3 P 3 (l - P.)' -1 

* P, P, P 2 ~ P. 

Q(n) = Qi(l - P,)" + Q a (l - P 2 ) n + Q 3 (l - P 3 ) n 
where P e is the average error rate. The recurrence relations become 

Ai(m, 11) = (1 - Pi)il,(»i, n - 1) + Pig u Ai(f?i - 1, n - 1) 

+ P 2 <7 21 4 2 (m - 1, n — 1) + P 3 g 3 iA 3 (m — 1, n — 1) 
A 2 (m, n) = (1 - P a )it a (m, n - 1) + P 2 q 22 A 2 (m — 1, n — 1) 
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+ P z q 3i A a (m — 1, n - 1) + P^A^m — 1, n — 1) 
4 3 (m, n) = (1 - P 3 )A 3 (?tt, n — 1) + P 3 g 3 3^ 3 (w - 1, n - 1) 

+ P^.aA^m — 1, n — 1) + P 2 q 2 iA 2 (m - 1, n - 1) 

where i4,(ra, n) is the probability that m errors have occurred and the 
error process is in state i after n digits. The solution of the recurrence 
relations gives 

P(m,n) = i4,(m, n) + A 2 (m, n) + A a (m,n) 

= GL(n)(l -F 1 ) n - m + G*(n)(l-P a )""" + G**(n)(l - P,)""" 

where G m (n) is a polynomial of degree m in the variable n. The coeffi- 
cients of G m {n) are complicated functions of the model parameters. 
G*(n) is G m (n) with the parameter subscripts cyclically permuted, 
that is, 1 — > 2, 2 — > 3, 3 -> 1. G**(n) is Gt(n) with the same cyclic 
permutation. We again have G (n) = ir!, so that 

P(0, n) = 72,(1 - 1\T + K 2 (l - P 2 ) n + B 3 (l - P»Y 

PCfcl.n) 

= 1 - P(0, n) 



I - (1 - PO" , n 1 - (1 - P,)" ,_ n ] I -P.)" l. 



Q, Pi + Q 2 p 2 +Q 8 p 3 



= P. 

The probability of being in state i at any digit is 



Qi 

p QS< _ Pi _ P Qs 

' Qifc + Q.& + Q.JE. Qi ,&.& ' p . 

Pi "*" Pa P 3 

V. EXPERIMENTAL PARAMETERS 

5.1 Tl Digital Transmission Line 

For the Tl digital transmission line* (see Refs. 8 and 9) , the error 
data was obtained by measurements 10 on three different lines, each 
looped to obtain an equivalent system length of about 24 miles. In 
total, there were five runs of approximately one hour duration, that 
is, about 5 X 10° digits each. The transmitted pattern was 10000000 
repeated. Each run produced about 100 errors. The data were proc- 



* Manufactured for Bell System use only, by Western Electric Co., manufac- 
turing and supply unit of the Bell System, 
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essed in real time with an IBM 7094 computer equipped with a 
direct data device. The results (a sequence of numbers a lt a 2 , . . . 
where a< is the number of good digits between the jth and [j + l]st 
errors) were recorded on a magnetic tape. 

To determine the parameters of the Markov error process model, 
we processed the experimental results as follows. 

(i) Add 1 to each a to get k, the error separation. 
(ii) Classify each k as state 

1 = "burst error state" 1 ^ k < 10 

2 = "intermediate error state" 10 ^ k < 10 3 

3 = "random error state" 10 3 ^ k. 

(Hi) For each state i, find the average error separation, k { = l/P* . 

(iv) From the sequence of states find the relative frequency of 
occurrence of state i, Q ( , and the relative frequency of occurrence of a 
transition from state i to state j, q fi . 

Steps in and iv were carried out for each run individually, and 
with all runs together (considered as one big sample). Table I lists 
the parameters (rounded to two significant digits) which were ob- 
tained by the above procedure. Notice that the conditions qn = q 2 i = 





Table I— 


-Markov Model for Tl 




Run 


QU 


Qt 


Pi 


P. 


1 


. 35 
.43 
.12 


.00 

.07 
.31 


.65 

.50 
.57 


.23 
.19 
.58 


.46 
3.2 X 10~ 3 
.86 X 10~ 8 


1.5 X lO" 8 


2 


.35 
.67 
.24 


.26 

.08 
.17 


.39 
.25 
.59 


.36 
.19 
.45 


.46 
4.0 X 10" 3 
.61 X 10" 8 


1.4 X 10" 8 


3 


.29 
.62 
.24 


.33 
.19 

.17 


.38 
.19 
.59 


.35 
.23 
.42 


.37 
4.9 X 10- 3 
.44 X 10-« 


1.0 X 10" 8 


4 


.53 
.50 

.19 


.22 
.15 

.20 


.25 
.35 
.61 


.38 
.20 
.42 


.53 
3.6 X 10" 3 
1.0 X lO" 8 


2.4 X lO" 8 


5 


.47 
.31 
.08 


.13 

.31 
.14 


.40 
.38 
.78 


.19 

.17 
.64 


.26 
3.7 X 10" 3 
1.0 X 10" 8 


1.6 X 10" 8 


All 


runs together 


.42 
.51 

.16 


.21 

.16 
.20 


.37 
.33 
.64 


.31 
.20 
.49 


.43 
3.7 X 10- 3 
.77 X lO" 8 


1.6 X 10~ 8 
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#31 and qi2 = #22 = Q32 and q^ = #03 = #33 are not satisfied. Hence, 
the Tl error process does not appear to be a renewal error process. 
(This does not mean that we should discard the renewal error process. 
In essence, it is a first approximation to a real error process, the 
Markov error process is a second approximation, and higher order 
Markov processes are higher order approximations. The first approxi- 
mation may be satisfactory in some applications.) Using the param- 
eters of Table I, the validity of the model was checked in three ways. 
First, the theoretical cumulative distribution of the error separation, 

Q(n) = Q,(l - PO" + Q,(l - P 2 ) n + 0.(1 - W 
was plotted for each run. The theoretical and experimental curves 
matched within approximately ±0.05, for all five runs. Typical curves 
are shown in Fig. 3a (semilog plot) and 3b (log-log plot) . Notice that 
we could have derived rough values for the Q t and P, by inspection 
of the experimental Q(n) curve. 
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Fig. 3 — Error separation statistics for Tl digital transmission line. 



REGENERATIVE CHANNEL 



1727 





V 




RUN 


4 








THEORY 

O EXPERIMENT 
(49 EVENTS) 


S ,0 a 

^ 6 
4 

2 
10-2 












"\ 
















„ TWO 


EVENTS 


1 >1 




„ ONE EVENT 










j\ 



I 2 3 4 5 6 7 

b= NUMBER OF SUCCESSIVE OCCURRENCES OF BURST STATE 



Fig. 4 — Burst statistics for Tl digital transmission line. 



Second, to check the burst error behavior of the channel, we compared 
the experimental and theoretical probability densities, p(b), of suc- 
cessive occurrences of the burst error state 

p(b) = &?Q. - ff „). 

The agreement is excellent as illustrated in Fig. 4. (Since our sample 
size is only 49, we should not expect the experimental points to follow 
the theoretical curve for probabilities of about 1/49 PH 0.02.) The ex- 
perimental and theoretical curves matched within approximately 0.02 
in all five runs. Notice that the procedure for calculating q u simply 
provides an exact match at b = 1. 

Third, to check the adequacy of the model for predicting block error 
statistics, we compared the theoretical and experimental (averaged 
over all possible phases) curves for P(m, n). Figures 5 and 6 show 
P(m, n) versus n and ra, respectively. The agreement is excellent for 
m ^ 4. For m ^ 5 the experimental curves are somewhat erratic owing 
to the small sample (the quantum of probability is approximately 
2 X 10" 10 in this case), which happened to contain two unusual error 
patterns. 

The excellent match between the experimental data and the model 
indicates that a three-state Markov error process with independent 
transitions is a good representation of the Tl error process. Since 
this is the case, it is useful to consider a physical interpretation of 
the mathematical model. The three different error states correspond 
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n= NUMBER OF DIGITS PER BLOCK 



Fig. 5 — Block error statistics (digits per block) for Tl digital transmission line. 



to different sources of error, of which only one is controlling at any 
given time. Allowing state transitions only at error digits corresponds 
with the fact that we cannot identify the controlling error process 
except by the error (and error separation) which it produces. 

As for the sources of error, we can make several speculations. Fur- 
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Fig. 6 — Block error statistics (errors per block) for Tl digital transmission line. 

ther experimental data will be required to determine which of the 
suggested possibilities is correct. "Burst errors" may result from signal 
correlated errors (generated in successive regenerators) following a 
random error, or from burst-like interferences such as impulse noise 
(all errors of the burst generated at the same regenerator). "Inter- 
mediate errors" may be caused by looping effects (outgoing and 
incoming regenerators are packaged together and are thus subject to 
the same interference) , or by slowly propagating interferences such 
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as teletype or other dc signaling. "Random errors" are assumed to 
come from thermal noise. 

We have thus far ignored the question of whether the error sequence 
is really independent of the transmitted signal sequence. Table II 
summarizes the available experimental data concerning this point. 
We observe that the average probability of error is roughly 5 to 40 
times greater when the signal digit X t = 0, depending on the number 
of digits since the last 1. (Notice, however, that Table II does not 
really tell us the number of digits since the last 1 because errors 
cause the "signal" to be different in successive regenerators.) Over 
all, the average probability of error is roughly 10 times greater when 
X t = 0. 

What does this imply about our model? First, since there are so 
few errors when Xi = 1, our component channels are nonsymmetric 
and the model parameters derived above essentially apply only when 
Xi = 0. In fact, it is possible that the error rate for X t = 1 is the same 
in all three component channels, so that no burst phenomena occurs 
if Xi = 1 for all i. Second, the dependence of the error rate on the 
number of digits since the last 1 probably results from intersymbol 
interference. This suggests that bursts might very well be signal cor- 
related errors which are generated in successive regenerators, in which 
case the average length of a burst should increase with the number of 
regenerators. Unfortunately, the available data are not sufficient to 
verify or disprove these conjectures. 

How do we correct our model to take into account the data presented 
in Table II? As a first approximation we would replace the three com- 
ponent binary symmetric channels with memoryless nonsymmetric 
binary channels with error probabilities P a , P 2 , and P 3 for X t = 0, 
and P' lt Pa, and P' 3 for X { = 1. With the limited data available the 
best we can do is to use the previously calculated values for P t ,P 2 , and 
P 3 , and let (using the figures for all runs) 

P[ =p' 2 =p' 3 = Tfr(1.6 X 10" 8 ) = 1.9 X 10 -10 . 

The computation of channel capacity and error statistics now becomes 
more difficult because we must consider the joint probability densities 
of the source and channel. However, we can still use the bounds for 
channel capacity given at the end of Section III. 

To get any better approximation we must replace the three channels 
with three nonsymmetric binary channels with memory. The memory 
would contain d, the number of digits since the last 1, and could probably 
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be limited to three states: d = 1, d = 2, and d ^ 3. (As already dis- 
cussed this memory would also generate burst phenomena so that we 
might possibly require only two channels for this model.) This approach 
is intuitively appealing for modeling the effects of intersymbol inter- 
ference, but we should use ternary channels with memory because the 
Tl digital transmission line actually transmits ternary signals. 

The fact that the average error probability is always greater for 
Xi = (that is, even for d > 3i probably is because of long term 
intersymbol interference, which in the case of Tl may persist for 
hundreds of digits. This interference is approximately proportional to 
the running sum of the digits TF, = X r + X 2 + . . . + X t . The Bipolar 
Code used in the Tl digital transmission system guarantees that Wi 
can assume only the values or —1 in the absence of errors.* We 
assume that the output of each regenerator is recoded into Bipolar 
so that the Wi satisfy the same constraint in successive links, and 
the channel can be described using a finite memory. Recoding allows 
one to localize errors to a particular digital link and reduces the 
error rate in successive links. If the output is not recoded, the Wi 
are theoretically unbounded which requires an infinite memory to 
describe the channel. 

To summarize our thoughts on the Tl error process, we may say 
the following. The Markov model analyzed in the preceding sections 
of this paper provides a good representation of the signal-independent 
error phenomena, and reproduces all the gross error statistics. The 
extension of the model suggested in this section shows promise of 
providing a good representation of the signal-dependent error phe- 
nomena, and should reproduce the fine grain error statistics; addi- 
tional data are required to determine the parameters and validity of 
the suggested extension. Notice that the signal-dependent memory is 
realized as a simple Markov process when the source digits are inde- 
pendent random variables. 

5.2 Switched Telephone Network 

We now consider the error model for the switched telephone net- 
work. Gilbert 1 has shown that a two-state Markov model provides a 
good approximation to the cumulative error separation distribution 
for an individual digital channel. Although Gilbert used a different 
model, his theoretical results for error separation are identical in 



* McCullough 6 treats the general class of ternary restricted sum codes for 
which the digit sum is bounded (—a ^ TFi ^ b) for every code sequence. 
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form to the results of Section IV, that is, 

Q(n) m 0,(1 - PO" + Q 2 (l - P 2 ) n . 

We notice, however, that if we choose Q, and P, so that the error 
separation distributions are identical, the "equivalent" binary regenera- 
tive channel will have a higher average error rate. When P <g.hq Gilbert's 
equation (14) becomes 



Q(n) 
so that 



- Q±H)<W + fe) (1 - P) " 



or 



P x « 1 - hq 
P 2 ^P 

^ 2 ~ Q - hq 
1 -P 1 



h 



1 - Q 2 Pi 

IT rZj ±2 

V ~ Q2P1 



and 



rw ~ 1 - Q 2 P! Q 2 ~ 1 - Q 2 Px ' * 
In our notation, the parameters for his examples (see Gilbert's Fig. 
3) are 

Channel 1146: Qi = P, arbitrary 

Q 2 = 1, P 2 = 5.4 X 10" 3 

Channel 1296: Q, = 0.816 P x = 0.190 

Q 2 = 0.184 P 2 = 2.57 X 10" 3 . 

For an average of many digital channels, a three-state Markov 
model can provide a reasonable numerical fit. Qi and P t were deter- 
mined for samples of the Alexander- Gryb-Nast, 11 Townsend-Watts, 12 
and Kelly 13 data on the error performance of the switched telephone 
network. Table III lists the parameters, which were determined by 
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Table III — Markov Model for Switched Telephone Network 
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trial and error matching of the Q(n) curves. It should be obvious 
that the Qi and P< were quantized rather coarsely. Figure 7 shows 
that the maximum difference between the experimental and theore- 
tical curves is about ±0.05. 

Although the numerical fit is reasonably good, it is evident that 
the sharp transition of a single independent-error process is not a 
good match to the gradual slope of the experimental curves at larger 
error separations. However, the experimental curves represent an 
average over many different channels. The parameters of the model 
will vary from channel to channel, resulting in an over-all error process 
which contains many states. Each state will have a small probability 
of occurrence (Qi) and a slightly different average error probability 
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(Pi). Q(n) for such an error process will exhibit a gradual slope at 
larger error separations. 

It is obvious that we could use a larger number of states in the 
Markov model, and match the experimental Q(n) curves to any 
desired degree of accuracy. At this point we should consider whether 
it is an individual channel or an average of many channels that we 
wish to match. Usually it will be the former, in which case a three- 
state (or perhaps even a two-state) Markov model will be satisfac- 
tory. If it is the latter, it makes sense to seek a single (noninde- 
pendent) error process which provides a better match for the gradual 
slope observed at larger error separations. A likely candidate is the 
Pareto distribution proposed by Berger and Mandelbrot. 4 They also 
give statistical evidence which supports the renewal error process 
hypothesis. Sussman 5 has shown that the Pareto distribution pro- 
vides a good fit to the Alexander-Gryb-Nast data. It is interesting 
that Sussman hypothesized that the Pareto distribution may be the 
limiting form of "the superposition of many unrelated error-causing 
events," which is exactly what our model suggests. 

To incorporate the Pareto distribution into our model, we would 
represent the cumulative distribution of error separation as 

Q(n) = Q x (l - PxT + Q*(n + l) _a 

where a is a parameter which would be chosen so as to give the best 
match to the experimental data. It should be recognized that the 
above distribution will not be a good approximation for the Markov 
error process for very large values of n. As n -> oo the Markov dis- 
tribution approaches 

Q(n) -> Q„(l - P M ) n 

where Q m and P x describe the channel with the smallest average 
random error rate. 

In some situations the Pareto distribution may also be a good 
representation of an individual channel. We have implicitly assumed 
stationary channels. A nonstationary channel whose parameters vary 
rapidly with time is essentially equivalent to the average of a large 
number of stationary channels, each with different parameters. Such 
a model may be appropriate for digital communication systems using 
radio links. On the other hand, a slowly varying nonstationary chan- 
nel is essentially equivalent to a single stationary channel, since the 
parameters will not change appreciably during any message of rea- 
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sonable length. This kind of model appears to be appropriate for 
digital communication systems using paired cable or coaxial cable. 
To summarize, we feel that the Markov model is a good representa- 
tion for the error process of an individual digital channel. The Markov 
model also explains the observed measurements for the average of a 
large number of digital channels, and leads naturally to the idea of 
using the Pareto distribution to approximate the behavior of a 
Markov error process with many states. 
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